Passive Classification of Source Printer using Text-line-level Geometric Distortion Signatures from Scanned Images of Printed Documents
نویسندگان
چکیده
In this digital era, one thing that still holds the convention is a printed archive. Printed documents find their use in many critical domains such as contract papers, legal tenders and proof of identity documents. As more advanced printing, scanning and image editing techniques are becoming available, forgeries on these legal tenders pose a serious threat. Ability to easily and reliably identify source printer of a printed document can help a lot in reducing this menace. During printing procedure, printer hardware introduces certain distortions in printed characters’ locations and shapes which are invisible to naked eyes. These distortions are referred as geometric distortions, their profile (or signature) is generally unique for each printer and can be used for printer classification purpose. This paper proposes a set of features for characterizing text-line-level geometric distortions, referred as geometric distortion signatures and presents a novel system to use them for identification of the origin of a printed document. Detailed experiments performed on a set of thirteen printers demonstrate that the proposed system achieves state of the art performance and gives much higher accuracy under small training size constraint. For four training and six test pages of three different fonts, the proposed method gives 99% classification accuracy.
منابع مشابه
رفع اعوجاج هندسی متون بهکمک اطلاعات هندسی خطوط متن
Document images produced by scanners or digital cameras usually have photometric and geometric distortions. If either of these effects distorts document, recognition of words from such a document image using OCR is subject to errors. In this paper we propose a novel approach to significantly remove geometric distortion from document images. In this method first we extract document lines from do...
متن کاملFabrication of New 3D Phantom for the measurement of Geometric Distortion in Magnetic Resonance Imaging System
Introduction: Geometric distortion, an important parameter in neurology and oncology. The current study aimed to design and construct a new three-dimensional (3D) phantom using a 3D printer in order to measure geometric distortion and its 3D reproducibility. Material and Methods: In this study, a new phantom ...
متن کاملFabrication of New 3D Phantom for Measuring Geometric Distortion in Magnetic Resonance Imaging System
Introduction: Geometric distortion is a major shortcoming of magnetic resonance imaging (MRI), which has an important influence on the accuracy of volumetric measurements, an important parameter in neurology and oncology. Our goal is to design and construct a new three- dimensional phantom using a 3D printer in order to measure geometric distortion and its reproducibility in...
متن کاملLocal Binary Patterns for Printer Identification based on Texture Analysis
This paper proposes a texture analysis of the printed document based on Local Binary Pattern (LBP) descriptor for the application of printer identification. The LBP provides a statistical description of the pixels’ gray level differences within their neighborhoods. The occurrence histogram of local binary patterns is able to capture the document’s texture modifications by the distortion during ...
متن کاملScript Identification of Text Words from a Tri Lingual Document Using Voting Technique
In a multi script environment, majority of the documents may contain text information printed in more than one script/language forms. For automatic processing of such documents through Optical Character Recognition (OCR), it is necessary to identify different script regions of the document. In this context, this paper proposes to develop a model to identify and separate text words of Kannada, H...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1706.06651 شماره
صفحات -
تاریخ انتشار 2017